Leveraging Geo-Referenced Digital Photographs Thesis Introduction
نویسنده
چکیده
As the photography world shifted from film cameras into digital cameras, computers now play a significant role in managing people’s photographs or, if you will, memories. Photos are stored, shared, searched and viewed — all in digital format. Managing large personal collections of digital photographs is an increasingly difficult task. As the rate of digital acquisition rises, storage becomes cheaper, and “snapping” new pictures gets easier, we are inching closer to Vannevar Bush’s 1945 Memex vision [2] of storing a lifetime’s worth of documents and photographs. At the same time, the usefulness of the collected photos is in doubt, given that the methods of access and retrieval are still limited. With digital photos, the opportunity to go “beyond the shoebox” is attractive, yet still not entirely fulfilled. One of the major hurdles for computer-based photo applications is the semantic gap. The semantic gap is defined by Smeulders et al. [8] as “the lack of coincidence between the information that one can extract from the visual data and the interpretation that the same data have for a user in a given situation.” Given perfect semantic knowledge about the photos, the task of organizing and retrieving from a photo collection would be made much easier. For example, if a system could automatically derive that a photo shows “Kimya drinking with Dylan at Robyn’s birthday, in New York”, this semantic knowledge could go a long way in helping users manage and retrieve from their collection. Sadly, current technology sometimes cannot even reliably detect that there are two people in the photo just described. The existing approaches towards photo collection management can be categorized into four main thrusts. First, there are tools to enable and ease manual annotation (e.g., [7]). These tools let the user rapidly enter semantic informa-
منابع مشابه
Leveraging Geo-referenced Digital Photographs a Dissertation Submitted to the Department of Computer Science and the Committee on Graduate Studies of Stanford University in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy
Given automatically captured metadata such as time and location about photos in a personal collection, we devised a series of methods for supporting photo management. These methods allow an enhanced level of semantic interaction with photo collections, while requiring little or no effort from the collection’s owner. This work describes how we automatically organize such geo-referenced collectio...
متن کاملAdventures in Space and Time: Browsing Personal Collections of Geo-Referenced Digital Photographs
We evaluate two novel applications for browsing personal collections of geo-referenced digital photographs. The first, PhotoCompas, is a browser that employs no graphical user interface elements other than the photos themselves (textual browser). PhotoCompas was developed in our project, and is based on an automated organization of the respective photo collection into clustered locations and ev...
متن کاملAssigning textual names to sets of geographic coordinates
NameSet is a system that translates a set of geographic coordinates into a textual name based on the geographic regions where the coordinates occur. One possible application of NameSet is to concisely present the geographical scope of a set of geo-referenced observations to a human user. Another application is to generate text to depict a set of coordinates that appear on a web site – text that...
متن کاملMetropogis: a Semi-automatic City Documentation System
In this paper we report on a new system to augment a 3D block model of a real city with geo-referenced terrestrial images of the facades. The terrestrial images are taken by a hand-held digital consumer camera using short baselines. The relative orientation of the photographs is calculated automatically and fitted towards the 3D block model with minimized human input using vanishing points. The...
متن کاملMetropogis: a city information system
In this paper we report on a new system to augment a 3D block model of a real city obtained from aerial photogrammetry or aerial laser scanning with geo-referenced terrestrial data of the facades. The terrestrial images are acquired by a hand-held digital consumer camera. The relative orientation of the photographs is calculated automatically and fitted towards the 3D block model with minimized...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005